3574 results found.
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
19k documents Production Status:
Existing-used
Use:
Document Classification, Text categorisation
-
Paper title:Word2Sense: Sparse Interpretable Word Embeddings
-
Paper track:Long/Word-level Semantics
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Abhishek Panigrahi | 20-NewsGroups | /N |
Documentation:
Publicly available
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
100 entries Production Status:
Existing-used
Use:
Word Sense Disambiguation
-
Paper title:Word2Sense: Sparse Interpretable Word Embeddings
-
Paper track:Long/Word-level Semantics
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Abhishek Panigrahi | Semeval 2010 word sense induction and disambiguation dataset | /N |
Documentation:
Publicly available
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
WaCKy
Size:
2 billion tokens Production Status:
Existing-used
Use:
Topic Detection and Tracking
-
Paper title:Word2Sense: Sparse Interpretable Word Embeddings
-
Paper track:Long/Word-level Semantics
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Abhishek Panigrahi | ukWaC | /N |
Documentation:
Publicly available
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
WaCKy
Size:
800 million tokens Production Status:
Existing-used
Use:
Topic Detection and Tracking
-
Paper title:Word2Sense: Sparse Interpretable Word Embeddings
-
Paper track:Long/Word-level Semantics
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Abhishek Panigrahi | WaCkypedia_en | /N |
Documentation:
Publicly available
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
None Production Status:
Newly created-finished
Use:
Summarisation
-
Paper title:TalkSumm: A Dataset and Scalable Annotation Method for Scientific Paper Summarization Based on Conference Talks
-
Paper track:Short/Summarization
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Michal Shmueli-Scheuer | TalkSumm | /N |
Documentation:
None
Text
Corpus,
Language Type:
Multilingual
Languages:
English French German Spanish
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Multilingual Unsupervised NMT using Shared Encoder and Language-Specific Decoders
-
Paper track:Short/Machine Translation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Sukanta Sen | WMT data | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
Size:
None Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
-
Paper title:Unsupervised Information Extraction: Regularizing Discriminative Approaches with Relation Distribution Losses
-
Paper track:Long/Information Extraction and Text Mining
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Étienne Simon | The New York Times Annotated Corpus | /N |
Documentation:
None
Written
Evaluation Data,
Language Type:
Monolingual
Languages:
English
Availability:
License:
Size:
~3000 entries Production Status:
Newly created-finished
Use:
Knowledge Discovery/Representation
-
Paper title:From Surrogacy to Adoption; From Bitcoin to Cryptocurrency: Debate Topic Expansion
-
Paper track:Long/Sentiment Analysis and Argument Mining
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Roy Bar-Haim | DTE - Debate Topic Expansion Dataset | /N |
Documentation:
None
Lexicon,
Language Type:
Monolingual
Languages:
English
Availability:
License:
Size:
None words Production Status:
Existing-used
Use:
-
Paper title:From Surrogacy to Adoption; From Bitcoin to Cryptocurrency: Debate Topic Expansion
-
Paper track:Long/Sentiment Analysis and Argument Mining
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Roy Bar-Haim | Opinion Lexicon | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
License:
Size:
None Production Status:
Existing-used
Use:
Knowledge Discovery/Representation
-
Paper title:From Surrogacy to Adoption; From Bitcoin to Cryptocurrency: Debate Topic Expansion
-
Paper track:Long/Sentiment Analysis and Argument Mining
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Roy Bar-Haim | Wikipedia | /N |
Documentation:
None




